Overview

Dataset statistics

Number of variables44
Number of observations53216
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory105.2 MiB
Average record size in memory2.0 KiB

Variable types

CAT32
NUM11
BOOL1

Warnings

glimepiride-pioglitazone has constant value "53216" Constant
medical_specialty has a high cardinality: 68 distinct values High cardinality
diag_1 has a high cardinality: 671 distinct values High cardinality
diag_2 has a high cardinality: 697 distinct values High cardinality
diag_3 has a high cardinality: 724 distinct values High cardinality
number_emergency is highly skewed (γ1 = 27.22616943) Skewed
num_procedures has 22882 (43.0%) zeros Zeros
number_outpatient has 46621 (87.6%) zeros Zeros
number_emergency has 49784 (93.6%) zeros Zeros
number_inpatient has 47199 (88.7%) zeros Zeros

Reproduction

Analysis started2020-09-13 00:02:14.125044
Analysis finished2020-09-13 00:02:37.853715
Duration23.73 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

race
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
Caucasian
41354 
AfricanAmerican
9295 
Hispanic
 
1204
Other
 
946
Asian
 
417
ValueCountFrequency (%) 
Caucasian4135477.7%
 
AfricanAmerican929517.5%
 
Hispanic12042.3%
 
Other9461.8%
 
Asian4170.8%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length15
Median length9
Mean length9.922917919
Min length5

Overview of Unicode Properties

Unique unicode characters17
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a14427327.3%
 
i6276911.9%
 
n6156511.7%
 
c6114811.6%
 
s429758.1%
 
C413547.8%
 
u413547.8%
 
r195363.7%
 
A190073.6%
 
e102411.9%
 
f92951.8%
 
m92951.8%
 
H12040.2%
 
p12040.2%
 
O9460.2%
 
t9460.2%
 
h9460.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter46554788.2%
 
Uppercase Letter6251111.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C4135466.2%
 
A1900730.4%
 
H12041.9%
 
O9461.5%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a14427331.0%
 
i6276913.5%
 
n6156513.2%
 
c6114813.1%
 
s429759.2%
 
u413548.9%
 
r195364.2%
 
e102412.2%
 
f92952.0%
 
m92952.0%
 
p12040.3%
 
t9460.2%
 
h9460.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin528058100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a14427327.3%
 
i6276911.9%
 
n6156511.7%
 
c6114811.6%
 
s429758.1%
 
C413547.8%
 
u413547.8%
 
r195363.7%
 
A190073.6%
 
e102411.9%
 
f92951.8%
 
m92951.8%
 
H12040.2%
 
p12040.2%
 
O9460.2%
 
t9460.2%
 
h9460.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII528058100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a14427327.3%
 
i6276911.9%
 
n6156511.7%
 
c6114811.6%
 
s429758.1%
 
C413547.8%
 
u413547.8%
 
r195363.7%
 
A190073.6%
 
e102411.9%
 
f92951.8%
 
m92951.8%
 
H12040.2%
 
p12040.2%
 
O9460.2%
 
t9460.2%
 
h9460.2%
 

gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
Female
28040 
Male
25176 
ValueCountFrequency (%) 
Female2804052.7%
 
Male2517647.3%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length6
Mean length5.0538184
Min length4

Overview of Unicode Properties

Unique unicode characters6
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e8125630.2%
 
a5321619.8%
 
l5321619.8%
 
F2804010.4%
 
m2804010.4%
 
M251769.4%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter21572880.2%
 
Uppercase Letter5321619.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
F2804052.7%
 
M2517647.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e8125637.7%
 
a5321624.7%
 
l5321624.7%
 
m2804013.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin268944100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e8125630.2%
 
a5321619.8%
 
l5321619.8%
 
F2804010.4%
 
m2804010.4%
 
M251769.4%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII268944100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e8125630.2%
 
a5321619.8%
 
l5321619.8%
 
F2804010.4%
 
m2804010.4%
 
M251769.4%
 

age
Categorical

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
[70-80)
13156 
[60-70)
12037 
[50-60)
9631 
[80-90)
8093 
[40-50)
5319 
Other values (5)
4980 
ValueCountFrequency (%) 
[70-80)1315624.7%
 
[60-70)1203722.6%
 
[50-60)963118.1%
 
[80-90)809315.2%
 
[40-50)531910.0%
 
[30-40)21354.0%
 
[90-100)13632.6%
 
[20-30)8931.7%
 
[10-20)4440.8%
 
[0-10)1450.3%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length8
Median length7
Mean length7.022887853
Min length6

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories4 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
010779528.8%
 
[5321614.2%
 
-5321614.2%
 
)5321614.2%
 
7251936.7%
 
6216685.8%
 
8212495.7%
 
5149504.0%
 
994562.5%
 
474542.0%
 
330280.8%
 
119520.5%
 
213370.4%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number21408257.3%
 
Open Punctuation5321614.2%
 
Dash Punctuation5321614.2%
 
Close Punctuation5321614.2%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
[53216100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
010779550.4%
 
72519311.8%
 
62166810.1%
 
8212499.9%
 
5149507.0%
 
994564.4%
 
474543.5%
 
330281.4%
 
119520.9%
 
213370.6%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-53216100.0%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)53216100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common373730100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
010779528.8%
 
[5321614.2%
 
-5321614.2%
 
)5321614.2%
 
7251936.7%
 
6216685.8%
 
8212495.7%
 
5149504.0%
 
994562.5%
 
474542.0%
 
330280.8%
 
119520.5%
 
213370.4%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII373730100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
010779528.8%
 
[5321614.2%
 
-5321614.2%
 
)5321614.2%
 
7251936.7%
 
6216685.8%
 
8212495.7%
 
5149504.0%
 
994562.5%
 
474542.0%
 
330280.8%
 
119520.5%
 
213370.4%
 

admission_type_id
Real number (ℝ≥0)

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.128626729
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size415.9 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile6
Maximum8
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.529284745
Coefficient of variation (CV)0.7184372553
Kurtosis1.681111817
Mean2.128626729
Median Absolute Deviation (MAD)1
Skewness1.526199488
Sum113277
Variance2.33871183
MonotocityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%) 
12659350.0%
 
31083720.4%
 
2978418.4%
 
637347.0%
 
519653.7%
 
82790.5%
 
716< 0.1%
 
48< 0.1%
 
ValueCountFrequency (%) 
12659350.0%
 
2978418.4%
 
31083720.4%
 
48< 0.1%
 
519653.7%
 
637347.0%
 
716< 0.1%
 
82790.5%
 
ValueCountFrequency (%) 
82790.5%
 
716< 0.1%
 
637347.0%
 
519653.7%
 
48< 0.1%
 
31083720.4%
 
2978418.4%
 
12659350.0%
 

discharge_disposition_id
Real number (ℝ≥0)

Distinct21
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.233012628
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Memory size415.9 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile18
Maximum28
Range27
Interquartile range (IQR)2

Descriptive statistics

Standard deviation4.948504232
Coefficient of variation (CV)1.530617044
Kurtosis8.464630929
Mean3.233012628
Median Absolute Deviation (MAD)0
Skewness2.991457699
Sum172048
Variance24.48769414
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
13470765.2%
 
3631211.9%
 
6607611.4%
 
1817813.3%
 
211572.2%
 
229431.8%
 
56781.3%
 
254820.9%
 
43950.7%
 
73130.6%
 
232030.4%
 
8520.1%
 
28480.1%
 
1523< 0.1%
 
2421< 0.1%
 
98< 0.1%
 
106< 0.1%
 
175< 0.1%
 
273< 0.1%
 
162< 0.1%
 
121< 0.1%
 
ValueCountFrequency (%) 
13470765.2%
 
211572.2%
 
3631211.9%
 
43950.7%
 
56781.3%
 
6607611.4%
 
73130.6%
 
8520.1%
 
98< 0.1%
 
106< 0.1%
 
ValueCountFrequency (%) 
28480.1%
 
273< 0.1%
 
254820.9%
 
2421< 0.1%
 
232030.4%
 
229431.8%
 
1817813.3%
 
175< 0.1%
 
162< 0.1%
 
1523< 0.1%
 

admission_source_id
Real number (ℝ≥0)

Distinct17
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.601792694
Minimum1
Maximum25
Zeros0
Zeros (%)0.0%
Memory size415.9 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median7
Q37
95-th percentile17
Maximum25
Range24
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.200107355
Coefficient of variation (CV)0.7497791483
Kurtosis1.631310767
Mean5.601792694
Median Absolute Deviation (MAD)0
Skewness1.097455438
Sum298105
Variance17.64090179
MonotocityNot monotonic
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%) 
72758651.8%
 
11675831.5%
 
1737217.0%
 
421734.1%
 
614722.8%
 
27821.5%
 
54020.8%
 
201300.2%
 
3870.2%
 
9810.2%
 
89< 0.1%
 
106< 0.1%
 
223< 0.1%
 
142< 0.1%
 
252< 0.1%
 
111< 0.1%
 
131< 0.1%
 
ValueCountFrequency (%) 
11675831.5%
 
27821.5%
 
3870.2%
 
421734.1%
 
54020.8%
 
614722.8%
 
72758651.8%
 
89< 0.1%
 
9810.2%
 
106< 0.1%
 
ValueCountFrequency (%) 
252< 0.1%
 
223< 0.1%
 
201300.2%
 
1737217.0%
 
142< 0.1%
 
131< 0.1%
 
111< 0.1%
 
106< 0.1%
 
9810.2%
 
89< 0.1%
 

time_in_hospital
Real number (ℝ≥0)

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.196144017
Minimum1
Maximum14
Zeros0
Zeros (%)0.0%
Memory size415.9 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q36
95-th percentile10
Maximum14
Range13
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.915286278
Coefficient of variation (CV)0.6947536278
Kurtosis1.126059328
Mean4.196144017
Median Absolute Deviation (MAD)2
Skewness1.215478421
Sum223302
Variance8.498894085
MonotocityNot monotonic
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%) 
3957018.0%
 
2945117.8%
 
1839015.8%
 
4708413.3%
 
550019.4%
 
637477.0%
 
728485.4%
 
820663.9%
 
913762.6%
 
1010932.1%
 
118931.7%
 
126541.2%
 
135741.1%
 
144690.9%
 
ValueCountFrequency (%) 
1839015.8%
 
2945117.8%
 
3957018.0%
 
4708413.3%
 
550019.4%
 
637477.0%
 
728485.4%
 
820663.9%
 
913762.6%
 
1010932.1%
 
ValueCountFrequency (%) 
144690.9%
 
135741.1%
 
126541.2%
 
118931.7%
 
1010932.1%
 
913762.6%
 
820663.9%
 
728485.4%
 
637477.0%
 
550019.4%
 

medical_specialty
Categorical

HIGH CARDINALITY

Distinct68
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
Unknow
25171 
InternalMedicine
8535 
Family/GeneralPractice
3661 
Cardiology
3406 
Emergency/Trauma
2885 
Other values (63)
9558 
ValueCountFrequency (%) 
Unknow2517147.3%
 
InternalMedicine853516.0%
 
Family/GeneralPractice36616.9%
 
Cardiology34066.4%
 
Emergency/Trauma28855.4%
 
Surgery-General17593.3%
 
Orthopedics-Reconstructive9141.7%
 
Orthopedics8731.6%
 
Radiologist6111.1%
 
ObstetricsandGynecology5321.0%
 
Pulmonology5091.0%
 
Nephrology4830.9%
 
Psychiatry4720.9%
 
Urology4180.8%
 
Surgery-Cardiovascular/Thoracic3920.7%
 
Surgery-Neuro3590.7%
 
Gastroenterology2770.5%
 
Surgery-Vascular2650.5%
 
Pediatrics1630.3%
 
PhysicalMedicineandRehabilitation1490.3%
 
Oncology1460.3%
 
Neurology1400.3%
 
Pediatrics-Endocrinology1370.3%
 
Otolaryngology940.2%
 
Surgery-Thoracic830.2%
 
Other values (43)7821.5%
 
Frequencies of value counts

Unique

Unique7 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length36
Median length10
Mean length11.22600346
Min length6

Overview of Unicode Properties

Unique unicode characters43
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
n8843014.8%
 
e564349.4%
 
o450767.5%
 
r410856.9%
 
a363606.1%
 
i360316.0%
 
c270674.5%
 
l267004.5%
 
U255894.3%
 
k251714.2%
 
w251714.2%
 
t198133.3%
 
y180393.0%
 
d163492.7%
 
g132402.2%
 
m100831.7%
 
M87011.5%
 
u86381.4%
 
I85631.4%
 
/70071.2%
 
s67441.1%
 
G63081.1%
 
P53060.9%
 
-41460.7%
 
C40120.7%
 
Other values (18)273404.6%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter50865885.1%
 
Uppercase Letter7757113.0%
 
Other Punctuation70281.2%
 
Dash Punctuation41460.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
U2558933.0%
 
M870111.2%
 
I856311.0%
 
G63088.1%
 
P53066.8%
 
C40125.2%
 
F36674.7%
 
T33634.3%
 
E31124.0%
 
S30654.0%
 
O27403.5%
 
R17172.2%
 
N9931.3%
 
V2650.3%
 
H1090.1%
 
A32< 0.1%
 
D29< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n8843017.4%
 
e5643411.1%
 
o450768.9%
 
r410858.1%
 
a363607.1%
 
i360317.1%
 
c270675.3%
 
l267005.2%
 
k251714.9%
 
w251714.9%
 
t198133.9%
 
y180393.5%
 
d163493.2%
 
g132402.6%
 
m100832.0%
 
u86381.7%
 
s67441.3%
 
h37030.7%
 
p23810.5%
 
v13920.3%
 
b7160.1%
 
f29< 0.1%
 
x6< 0.1%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-4146100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/700799.7%
 
&210.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin58622998.1%
 
Common111741.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n8843015.1%
 
e564349.6%
 
o450767.7%
 
r410857.0%
 
a363606.2%
 
i360316.1%
 
c270674.6%
 
l267004.6%
 
U255894.4%
 
k251714.3%
 
w251714.3%
 
t198133.4%
 
y180393.1%
 
d163492.8%
 
g132402.3%
 
m100831.7%
 
M87011.5%
 
u86381.5%
 
I85631.5%
 
s67441.2%
 
G63081.1%
 
P53060.9%
 
C40120.7%
 
h37030.6%
 
F36670.6%
 
Other values (15)199493.4%
 

Most frequent Common characters

ValueCountFrequency (%) 
/700762.7%
 
-414637.1%
 
&210.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII597403100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
n8843014.8%
 
e564349.4%
 
o450767.5%
 
r410856.9%
 
a363606.1%
 
i360316.0%
 
c270674.5%
 
l267004.5%
 
U255894.3%
 
k251714.2%
 
w251714.2%
 
t198133.3%
 
y180393.0%
 
d163492.7%
 
g132402.2%
 
m100831.7%
 
M87011.5%
 
u86381.4%
 
I85631.4%
 
/70071.2%
 
s67441.1%
 
G63081.1%
 
P53060.9%
 
-41460.7%
 
C40120.7%
 
Other values (18)273404.6%
 

num_lab_procedures
Real number (ℝ≥0)

Distinct115
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.12973542
Minimum1
Maximum121
Zeros0
Zeros (%)0.0%
Memory size415.9 KiB

Quantile statistics

Minimum1
5-th percentile4
Q131
median44
Q357
95-th percentile74
Maximum121
Range120
Interquartile range (IQR)26

Descriptive statistics

Standard deviation19.93800422
Coefficient of variation (CV)0.4622797711
Kurtosis-0.2967056405
Mean43.12973542
Median Absolute Deviation (MAD)13
Skewness-0.219676608
Sum2295192
Variance397.5240122
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
116623.1%
 
4313592.6%
 
4412502.3%
 
4511872.2%
 
3811702.2%
 
4711482.2%
 
4611462.2%
 
4011302.1%
 
3710872.0%
 
4110732.0%
 
4810602.0%
 
3910582.0%
 
4210432.0%
 
4910191.9%
 
5110071.9%
 
5010071.9%
 
559821.8%
 
359691.8%
 
369641.8%
 
549561.8%
 
569381.8%
 
539331.8%
 
529121.7%
 
579031.7%
 
588791.7%
 
Other values (90)2637449.6%
 
ValueCountFrequency (%) 
116623.1%
 
25681.1%
 
33830.7%
 
42340.4%
 
51640.3%
 
61560.3%
 
71960.4%
 
81730.3%
 
95091.0%
 
104520.8%
 
ValueCountFrequency (%) 
1211< 0.1%
 
1201< 0.1%
 
1181< 0.1%
 
1141< 0.1%
 
1132< 0.1%
 
1112< 0.1%
 
1091< 0.1%
 
1083< 0.1%
 
1071< 0.1%
 
1064< 0.1%
 

num_procedures
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.460331479
Minimum0
Maximum6
Zeros22882
Zeros (%)43.0%
Memory size415.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile6
Maximum6
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.768794512
Coefficient of variation (CV)1.211228093
Kurtosis0.470195194
Mean1.460331479
Median Absolute Deviation (MAD)1
Skewness1.194358286
Sum77713
Variance3.128634027
MonotocityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%) 
02288243.0%
 
11069720.1%
 
2697113.1%
 
3550310.3%
 
630225.7%
 
422724.3%
 
518693.5%
 
ValueCountFrequency (%) 
02288243.0%
 
11069720.1%
 
2697113.1%
 
3550310.3%
 
422724.3%
 
518693.5%
 
630225.7%
 
ValueCountFrequency (%) 
630225.7%
 
518693.5%
 
422724.3%
 
3550310.3%
 
2697113.1%
 
11069720.1%
 
02288243.0%
 

num_medications
Real number (ℝ≥0)

Distinct73
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.57514657
Minimum1
Maximum81
Zeros0
Zeros (%)0.0%
Memory size415.9 KiB

Quantile statistics

Minimum1
5-th percentile5
Q110
median14
Q320
95-th percentile31
Maximum81
Range80
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.390698654
Coefficient of variation (CV)0.5387235757
Kurtosis3.766971434
Mean15.57514657
Median Absolute Deviation (MAD)5
Skewness1.444972846
Sum828847
Variance70.4038239
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1232006.0%
 
1331565.9%
 
1130405.7%
 
1029905.6%
 
1429535.5%
 
1529305.5%
 
927775.2%
 
1627105.1%
 
825164.7%
 
1724504.6%
 
1821514.0%
 
720673.9%
 
1919653.7%
 
2017603.3%
 
616713.1%
 
2114832.8%
 
2213472.5%
 
512882.4%
 
2311642.2%
 
249461.8%
 
259261.7%
 
49231.7%
 
267781.5%
 
276801.3%
 
35931.1%
 
Other values (48)47528.9%
 
ValueCountFrequency (%) 
11810.3%
 
23250.6%
 
35931.1%
 
49231.7%
 
512882.4%
 
616713.1%
 
720673.9%
 
825164.7%
 
927775.2%
 
1029905.6%
 
ValueCountFrequency (%) 
811< 0.1%
 
791< 0.1%
 
752< 0.1%
 
741< 0.1%
 
692< 0.1%
 
682< 0.1%
 
671< 0.1%
 
664< 0.1%
 
658< 0.1%
 
645< 0.1%
 

number_outpatient
Real number (ℝ≥0)

ZEROS

Distinct29
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2658223091
Minimum0
Maximum36
Zeros46621
Zeros (%)87.6%
Memory size415.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum36
Range36
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.019399149
Coefficient of variation (CV)3.834889374
Kurtosis163.6847484
Mean0.2658223091
Median Absolute Deviation (MAD)0
Skewness9.167709296
Sum14146
Variance1.039174624
MonotocityNot monotonic
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%) 
04662187.6%
 
134226.4%
 
214302.7%
 
38201.5%
 
44270.8%
 
52030.4%
 
6920.2%
 
7460.1%
 
8450.1%
 
9270.1%
 
1016< 0.1%
 
1113< 0.1%
 
1210< 0.1%
 
138< 0.1%
 
158< 0.1%
 
146< 0.1%
 
165< 0.1%
 
174< 0.1%
 
182< 0.1%
 
202< 0.1%
 
221< 0.1%
 
251< 0.1%
 
331< 0.1%
 
241< 0.1%
 
191< 0.1%
 
Other values (4)4< 0.1%
 
ValueCountFrequency (%) 
04662187.6%
 
134226.4%
 
214302.7%
 
38201.5%
 
44270.8%
 
52030.4%
 
6920.2%
 
7460.1%
 
8450.1%
 
9270.1%
 
ValueCountFrequency (%) 
361< 0.1%
 
351< 0.1%
 
331< 0.1%
 
291< 0.1%
 
261< 0.1%
 
251< 0.1%
 
241< 0.1%
 
221< 0.1%
 
202< 0.1%
 
191< 0.1%
 

number_emergency
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct16
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.09061184606
Minimum0
Maximum42
Zeros49784
Zeros (%)93.6%
Memory size415.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum42
Range42
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4967822738
Coefficient of variation (CV)5.482531208
Kurtosis1741.964029
Mean0.09061184606
Median Absolute Deviation (MAD)0
Skewness27.22616943
Sum4822
Variance0.2467926276
MonotocityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%) 
04978493.6%
 
126715.0%
 
24950.9%
 
31540.3%
 
4490.1%
 
522< 0.1%
 
618< 0.1%
 
86< 0.1%
 
75< 0.1%
 
104< 0.1%
 
93< 0.1%
 
251< 0.1%
 
201< 0.1%
 
111< 0.1%
 
421< 0.1%
 
371< 0.1%
 
ValueCountFrequency (%) 
04978493.6%
 
126715.0%
 
24950.9%
 
31540.3%
 
4490.1%
 
522< 0.1%
 
618< 0.1%
 
75< 0.1%
 
86< 0.1%
 
93< 0.1%
 
ValueCountFrequency (%) 
421< 0.1%
 
371< 0.1%
 
251< 0.1%
 
201< 0.1%
 
111< 0.1%
 
104< 0.1%
 
93< 0.1%
 
86< 0.1%
 
75< 0.1%
 
618< 0.1%
 

number_inpatient
Real number (ℝ≥0)

ZEROS

Distinct13
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1696482261
Minimum0
Maximum12
Zeros47199
Zeros (%)88.7%
Memory size415.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum12
Range12
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.5882180591
Coefficient of variation (CV)3.46728093
Kurtosis50.67286468
Mean0.1696482261
Median Absolute Deviation (MAD)0
Skewness5.711601936
Sum9028
Variance0.3460004851
MonotocityNot monotonic
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%) 
04719988.7%
 
142488.0%
 
211152.1%
 
33530.7%
 
41670.3%
 
5620.1%
 
6390.1%
 
711< 0.1%
 
810< 0.1%
 
104< 0.1%
 
94< 0.1%
 
122< 0.1%
 
112< 0.1%
 
ValueCountFrequency (%) 
04719988.7%
 
142488.0%
 
211152.1%
 
33530.7%
 
41670.3%
 
5620.1%
 
6390.1%
 
711< 0.1%
 
810< 0.1%
 
94< 0.1%
 
ValueCountFrequency (%) 
122< 0.1%
 
112< 0.1%
 
104< 0.1%
 
94< 0.1%
 
810< 0.1%
 
711< 0.1%
 
6390.1%
 
5620.1%
 
41670.3%
 
33530.7%
 

diag_1
Categorical

HIGH CARDINALITY

Distinct671
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
414
4102 
428
 
2554
786
 
2381
410
 
2169
486
 
1704
Other values (666)
40306 
ValueCountFrequency (%) 
41441027.7%
 
42825544.8%
 
78623814.5%
 
41021694.1%
 
48617043.2%
 
42715953.0%
 
71515823.0%
 
43411082.1%
 
68210942.1%
 
78010662.0%
 
4919061.7%
 
2768401.6%
 
9968351.6%
 
250.87771.5%
 
387491.4%
 
5997121.3%
 
5846851.3%
 
5746121.2%
 
8206051.1%
 
4355801.1%
 
7225601.1%
 
5625431.0%
 
V575361.0%
 
5775111.0%
 
2965020.9%
 
Other values (646)2390844.9%
 
Frequencies of value counts

Unique

Unique98 ?
Unique (%)0.2%
Histogram of lengths of the category

Length

Max length6
Median length3
Mean length3.165175887
Min length1

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
42965917.6%
 
22055512.2%
 
51919111.4%
 
81867611.1%
 
1160999.6%
 
7156649.3%
 
0130567.8%
 
6122687.3%
 
992685.5%
 
392265.5%
 
.40422.4%
 
V7330.4%
 
E1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number16366297.2%
 
Other Punctuation40422.4%
 
Uppercase Letter7340.4%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
42965918.1%
 
22055512.6%
 
51919111.7%
 
81867611.4%
 
1160999.8%
 
7156649.6%
 
0130568.0%
 
6122687.5%
 
992685.7%
 
392265.6%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.4042100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
V73399.9%
 
E10.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Common16770499.6%
 
Latin7340.4%
 

Most frequent Common characters

ValueCountFrequency (%) 
42965917.7%
 
22055512.3%
 
51919111.4%
 
81867611.1%
 
1160999.6%
 
7156649.3%
 
0130567.8%
 
6122687.3%
 
992685.5%
 
392265.5%
 
.40422.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
V73399.9%
 
E10.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII168438100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
42965917.6%
 
22055512.2%
 
51919111.4%
 
81867611.1%
 
1160999.6%
 
7156649.3%
 
0130567.8%
 
6122687.3%
 
992685.5%
 
392265.5%
 
.40422.4%
 
V7330.4%
 
E1< 0.1%
 

diag_2
Categorical

HIGH CARDINALITY

Distinct697
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
250
4120 
276
 
3317
428
 
3014
427
 
2560
401
 
2548
Other values (692)
37657 
ValueCountFrequency (%) 
25041207.7%
 
27633176.2%
 
42830145.7%
 
42725604.8%
 
40125484.8%
 
59916583.1%
 
41115953.0%
 
41415833.0%
 
49615692.9%
 
250.0211252.1%
 
40310331.9%
 
250.018651.6%
 
2858581.6%
 
7078021.5%
 
7807691.4%
 
5847611.4%
 
6826631.2%
 
5856581.2%
 
4866361.2%
 
4136351.2%
 
4256321.2%
 
4246281.2%
 
5186051.1%
 
4916011.1%
 
4934650.9%
 
Other values (672)1951636.7%
 
Frequencies of value counts

Unique

Unique132 ?
Unique (%)0.2%
Histogram of lengths of the category

Length

Max length6
Median length3
Mean length3.180753909
Min length1

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
22669715.8%
 
42660915.7%
 
52002511.8%
 
01881811.1%
 
1148398.8%
 
7145328.6%
 
8142738.4%
 
9112596.7%
 
6101786.0%
 
370534.2%
 
.35592.1%
 
V9590.6%
 
E4660.3%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number16428397.1%
 
Other Punctuation35592.1%
 
Uppercase Letter14250.8%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
22669716.3%
 
42660916.2%
 
52002512.2%
 
01881811.5%
 
1148399.0%
 
7145328.8%
 
8142738.7%
 
9112596.9%
 
6101786.2%
 
370534.3%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.3559100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
V95967.3%
 
E46632.7%
 

Most occurring scripts

ValueCountFrequency (%) 
Common16784299.2%
 
Latin14250.8%
 

Most frequent Common characters

ValueCountFrequency (%) 
22669715.9%
 
42660915.9%
 
52002511.9%
 
01881811.2%
 
1148398.8%
 
7145328.7%
 
8142738.5%
 
9112596.7%
 
6101786.1%
 
370534.2%
 
.35592.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
V95967.3%
 
E46632.7%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII169267100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
22669715.8%
 
42660915.7%
 
52002511.8%
 
01881811.1%
 
1148398.8%
 
7145328.6%
 
8142738.4%
 
9112596.7%
 
6101786.0%
 
370534.2%
 
.35592.1%
 
V9590.6%
 
E4660.3%
 

diag_3
Categorical

HIGH CARDINALITY

Distinct724
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
250
7578 
401
5268 
276
 
2552
414
 
2035
427
 
1927
Other values (719)
33856 
ValueCountFrequency (%) 
250757814.2%
 
40152689.9%
 
27625524.8%
 
41420353.8%
 
42719273.6%
 
42819183.6%
 
27213502.5%
 
49611032.1%
 
5999211.7%
 
4038631.6%
 
250.026911.3%
 
7806661.3%
 
2856581.2%
 
5856421.2%
 
V456341.2%
 
4245991.1%
 
3055531.0%
 
250.015351.0%
 
7075311.0%
 
4255171.0%
 
2784470.8%
 
250.64420.8%
 
414230.8%
 
5844150.8%
 
6823860.7%
 
Other values (699)1956236.8%
 
Frequencies of value counts

Unique

Unique119 ?
Unique (%)0.2%
Histogram of lengths of the category

Length

Max length6
Median length3
Mean length3.136556675
Min length1

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
22821416.9%
 
42554415.3%
 
02270613.6%
 
52174013.0%
 
1140908.4%
 
7136538.2%
 
8115376.9%
 
986745.2%
 
680994.9%
 
371634.3%
 
.27591.7%
 
V19871.2%
 
E7490.4%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number16142096.7%
 
Other Punctuation27591.7%
 
Uppercase Letter27361.6%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
22821417.5%
 
42554415.8%
 
02270614.1%
 
52174013.5%
 
1140908.7%
 
7136538.5%
 
8115377.1%
 
986745.4%
 
680995.0%
 
371634.4%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.2759100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
V198772.6%
 
E74927.4%
 

Most occurring scripts

ValueCountFrequency (%) 
Common16417998.4%
 
Latin27361.6%
 

Most frequent Common characters

ValueCountFrequency (%) 
22821417.2%
 
42554415.6%
 
02270613.8%
 
52174013.2%
 
1140908.6%
 
7136538.3%
 
8115377.0%
 
986745.3%
 
680994.9%
 
371634.4%
 
.27591.7%
 

Most frequent Latin characters

ValueCountFrequency (%) 
V198772.6%
 
E74927.4%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII166915100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
22821416.9%
 
42554415.3%
 
02270613.6%
 
52174013.0%
 
1140908.4%
 
7136538.2%
 
8115376.9%
 
986745.2%
 
680994.9%
 
371634.3%
 
.27591.7%
 
V19871.2%
 
E7490.4%
 

number_diagnoses
Real number (ℝ≥0)

Distinct16
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.153337342
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Memory size415.9 KiB

Quantile statistics

Minimum1
5-th percentile3
Q15
median8
Q39
95-th percentile9
Maximum16
Range15
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.030467766
Coefficient of variation (CV)0.283849016
Kurtosis-0.4380369826
Mean7.153337342
Median Absolute Deviation (MAD)1
Skewness-0.6735148769
Sum380672
Variance4.122799348
MonotocityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%) 
92266142.6%
 
5706813.3%
 
6589511.1%
 
7565210.6%
 
8551510.4%
 
435426.7%
 
319203.6%
 
27341.4%
 
11760.3%
 
1626< 0.1%
 
138< 0.1%
 
155< 0.1%
 
114< 0.1%
 
104< 0.1%
 
143< 0.1%
 
123< 0.1%
 
ValueCountFrequency (%) 
11760.3%
 
27341.4%
 
319203.6%
 
435426.7%
 
5706813.3%
 
6589511.1%
 
7565210.6%
 
8551510.4%
 
92266142.6%
 
104< 0.1%
 
ValueCountFrequency (%) 
1626< 0.1%
 
155< 0.1%
 
143< 0.1%
 
138< 0.1%
 
123< 0.1%
 
114< 0.1%
 
104< 0.1%
 
92266142.6%
 
8551510.4%
 
7565210.6%
 

max_glu_serum
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
None
50853 
Norm
 
1231
>200
 
675
>300
 
457
ValueCountFrequency (%) 
None5085395.6%
 
Norm12312.3%
 
>2006751.3%
 
>3004570.9%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length4
Median length4
Mean length4
Min length4

Overview of Unicode Properties

Unique unicode characters10
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5208424.5%
 
o5208424.5%
 
n5085323.9%
 
e5085323.9%
 
022641.1%
 
r12310.6%
 
m12310.6%
 
>11320.5%
 
26750.3%
 
34570.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter15625273.4%
 
Uppercase Letter5208424.5%
 
Decimal Number33961.6%
 
Math Symbol11320.5%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N52084100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5208433.3%
 
n5085332.5%
 
e5085332.5%
 
r12310.8%
 
m12310.8%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
>1132100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
0226466.7%
 
267519.9%
 
345713.5%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin20833697.9%
 
Common45282.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5208425.0%
 
o5208425.0%
 
n5085324.4%
 
e5085324.4%
 
r12310.6%
 
m12310.6%
 

Most frequent Common characters

ValueCountFrequency (%) 
0226450.0%
 
>113225.0%
 
267514.9%
 
345710.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII212864100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5208424.5%
 
o5208424.5%
 
n5085323.9%
 
e5085323.9%
 
022641.1%
 
r12310.6%
 
m12310.6%
 
>11320.5%
 
26750.3%
 
34570.2%
 

A1Cresult
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
None
43216 
>8
4797 
Norm
 
2974
>7
 
2229
ValueCountFrequency (%) 
None4321681.2%
 
>847979.0%
 
Norm29745.6%
 
>722294.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length4
Median length4
Mean length3.735944077
Min length2

Overview of Unicode Properties

Unique unicode characters9
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N4619023.2%
 
o4619023.2%
 
n4321621.7%
 
e4321621.7%
 
>70263.5%
 
847972.4%
 
r29741.5%
 
m29741.5%
 
722291.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter13857069.7%
 
Uppercase Letter4619023.2%
 
Math Symbol70263.5%
 
Decimal Number70263.5%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N46190100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o4619033.3%
 
n4321631.2%
 
e4321631.2%
 
r29742.1%
 
m29742.1%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
>7026100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
8479768.3%
 
7222931.7%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin18476092.9%
 
Common140527.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N4619025.0%
 
o4619025.0%
 
n4321623.4%
 
e4321623.4%
 
r29741.6%
 
m29741.6%
 

Most frequent Common characters

ValueCountFrequency (%) 
>702650.0%
 
8479734.1%
 
7222915.9%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII198812100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N4619023.2%
 
o4619023.2%
 
n4321621.7%
 
e4321621.7%
 
>70263.5%
 
847972.4%
 
r29741.5%
 
m29741.5%
 
722291.1%
 

metformin
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
41657 
Steady
10555 
Up
 
657
Down
 
347
ValueCountFrequency (%) 
No4165778.3%
 
Steady1055519.8%
 
Up6571.2%
 
Down3470.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.806411606
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o4200428.1%
 
N4165727.9%
 
S105557.1%
 
t105557.1%
 
e105557.1%
 
a105557.1%
 
d105557.1%
 
y105557.1%
 
U6570.4%
 
p6570.4%
 
D3470.2%
 
w3470.2%
 
n3470.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter9613064.4%
 
Uppercase Letter5321635.6%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N4165778.3%
 
S1055519.8%
 
U6571.2%
 
D3470.7%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o4200443.7%
 
t1055511.0%
 
e1055511.0%
 
a1055511.0%
 
d1055511.0%
 
y1055511.0%
 
p6570.7%
 
w3470.4%
 
n3470.4%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin149346100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o4200428.1%
 
N4165727.9%
 
S105557.1%
 
t105557.1%
 
e105557.1%
 
a105557.1%
 
d105557.1%
 
y105557.1%
 
U6570.4%
 
p6570.4%
 
D3470.2%
 
w3470.2%
 
n3470.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII149346100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o4200428.1%
 
N4165727.9%
 
S105557.1%
 
t105557.1%
 
e105557.1%
 
a105557.1%
 
d105557.1%
 
y105557.1%
 
U6570.4%
 
p6570.4%
 
D3470.2%
 
w3470.2%
 
n3470.2%
 

repaglinide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
52575 
Steady
 
565
Up
 
53
Down
 
23
ValueCountFrequency (%) 
No5257598.8%
 
Steady5651.1%
 
Up530.1%
 
Down23< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.043332832
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o5259848.4%
 
N5257548.4%
 
S5650.5%
 
t5650.5%
 
e5650.5%
 
a5650.5%
 
d5650.5%
 
y5650.5%
 
U53< 0.1%
 
p53< 0.1%
 
D23< 0.1%
 
w23< 0.1%
 
n23< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5552251.1%
 
Uppercase Letter5321648.9%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N5257598.8%
 
S5651.1%
 
U530.1%
 
D23< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5259894.7%
 
t5651.0%
 
e5651.0%
 
a5651.0%
 
d5651.0%
 
y5651.0%
 
p530.1%
 
w23< 0.1%
 
n23< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin108738100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o5259848.4%
 
N5257548.4%
 
S5650.5%
 
t5650.5%
 
e5650.5%
 
a5650.5%
 
d5650.5%
 
y5650.5%
 
U53< 0.1%
 
p53< 0.1%
 
D23< 0.1%
 
w23< 0.1%
 
n23< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII108738100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o5259848.4%
 
N5257548.4%
 
S5650.5%
 
t5650.5%
 
e5650.5%
 
a5650.5%
 
d5650.5%
 
y5650.5%
 
U53< 0.1%
 
p53< 0.1%
 
D23< 0.1%
 
w23< 0.1%
 
n23< 0.1%
 

nateglinide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
52846 
Steady
 
350
Up
 
14
Down
 
6
ValueCountFrequency (%) 
No5284699.3%
 
Steady3500.7%
 
Up14< 0.1%
 
Down6< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.026533373
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o5285249.0%
 
N5284649.0%
 
S3500.3%
 
t3500.3%
 
e3500.3%
 
a3500.3%
 
d3500.3%
 
y3500.3%
 
U14< 0.1%
 
p14< 0.1%
 
D6< 0.1%
 
w6< 0.1%
 
n6< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5462850.7%
 
Uppercase Letter5321649.3%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N5284699.3%
 
S3500.7%
 
U14< 0.1%
 
D6< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5285296.7%
 
t3500.6%
 
e3500.6%
 
a3500.6%
 
d3500.6%
 
y3500.6%
 
p14< 0.1%
 
w6< 0.1%
 
n6< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin107844100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o5285249.0%
 
N5284649.0%
 
S3500.3%
 
t3500.3%
 
e3500.3%
 
a3500.3%
 
d3500.3%
 
y3500.3%
 
U14< 0.1%
 
p14< 0.1%
 
D6< 0.1%
 
w6< 0.1%
 
n6< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII107844100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o5285249.0%
 
N5284649.0%
 
S3500.3%
 
t3500.3%
 
e3500.3%
 
a3500.3%
 
d3500.3%
 
y3500.3%
 
U14< 0.1%
 
p14< 0.1%
 
D6< 0.1%
 
w6< 0.1%
 
n6< 0.1%
 

chlorpropamide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53159 
Steady
 
53
Up
 
3
Down
 
1
ValueCountFrequency (%) 
No5315999.9%
 
Steady530.1%
 
Up3< 0.1%
 
Down1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.004021347
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o5316049.8%
 
N5315949.8%
 
S53< 0.1%
 
t53< 0.1%
 
e53< 0.1%
 
a53< 0.1%
 
d53< 0.1%
 
y53< 0.1%
 
U3< 0.1%
 
p3< 0.1%
 
D1< 0.1%
 
w1< 0.1%
 
n1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5343050.1%
 
Uppercase Letter5321649.9%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N5315999.9%
 
S530.1%
 
U3< 0.1%
 
D1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5316099.5%
 
t530.1%
 
e530.1%
 
a530.1%
 
d530.1%
 
y530.1%
 
p3< 0.1%
 
w1< 0.1%
 
n1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106646100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o5316049.8%
 
N5315949.8%
 
S53< 0.1%
 
t53< 0.1%
 
e53< 0.1%
 
a53< 0.1%
 
d53< 0.1%
 
y53< 0.1%
 
U3< 0.1%
 
p3< 0.1%
 
D1< 0.1%
 
w1< 0.1%
 
n1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106646100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o5316049.8%
 
N5315949.8%
 
S53< 0.1%
 
t53< 0.1%
 
e53< 0.1%
 
a53< 0.1%
 
d53< 0.1%
 
y53< 0.1%
 
U3< 0.1%
 
p3< 0.1%
 
D1< 0.1%
 
w1< 0.1%
 
n1< 0.1%
 

glimepiride
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
50383 
Steady
 
2551
Up
 
181
Down
 
101
ValueCountFrequency (%) 
No5038394.7%
 
Steady25514.8%
 
Up1810.3%
 
Down1010.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.195542694
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o5048443.2%
 
N5038343.1%
 
S25512.2%
 
t25512.2%
 
e25512.2%
 
a25512.2%
 
d25512.2%
 
y25512.2%
 
U1810.2%
 
p1810.2%
 
D1010.1%
 
w1010.1%
 
n1010.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter6362254.5%
 
Uppercase Letter5321645.5%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N5038394.7%
 
S25514.8%
 
U1810.3%
 
D1010.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5048479.3%
 
t25514.0%
 
e25514.0%
 
a25514.0%
 
d25514.0%
 
y25514.0%
 
p1810.3%
 
w1010.2%
 
n1010.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin116838100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o5048443.2%
 
N5038343.1%
 
S25512.2%
 
t25512.2%
 
e25512.2%
 
a25512.2%
 
d25512.2%
 
y25512.2%
 
U1810.2%
 
p1810.2%
 
D1010.1%
 
w1010.1%
 
n1010.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII116838100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o5048443.2%
 
N5038343.1%
 
S25512.2%
 
t25512.2%
 
e25512.2%
 
a25512.2%
 
d25512.2%
 
y25512.2%
 
U1810.2%
 
p1810.2%
 
D1010.1%
 
w1010.1%
 
n1010.1%
 

acetohexamide
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53215 
Steady
 
1
ValueCountFrequency (%) 
No53215> 99.9%
 
Steady1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000075165
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5321550.0%
 
o5321550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5322050.0%
 
Uppercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53215> 99.9%
 
S1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o53215> 99.9%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106436100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5321550.0%
 
o5321550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106436100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5321550.0%
 
o5321550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

glipizide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
46510 
Steady
6021 
Up
 
438
Down
 
247
ValueCountFrequency (%) 
No4651087.4%
 
Steady602111.3%
 
Up4380.8%
 
Down2470.5%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.461853578
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o4675735.7%
 
N4651035.5%
 
S60214.6%
 
t60214.6%
 
e60214.6%
 
a60214.6%
 
d60214.6%
 
y60214.6%
 
U4380.3%
 
p4380.3%
 
D2470.2%
 
w2470.2%
 
n2470.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter7779459.4%
 
Uppercase Letter5321640.6%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N4651087.4%
 
S602111.3%
 
U4380.8%
 
D2470.5%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o4675760.1%
 
t60217.7%
 
e60217.7%
 
a60217.7%
 
d60217.7%
 
y60217.7%
 
p4380.6%
 
w2470.3%
 
n2470.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin131010100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o4675735.7%
 
N4651035.5%
 
S60214.6%
 
t60214.6%
 
e60214.6%
 
a60214.6%
 
d60214.6%
 
y60214.6%
 
U4380.3%
 
p4380.3%
 
D2470.2%
 
w2470.2%
 
n2470.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII131010100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o4675735.7%
 
N4651035.5%
 
S60214.6%
 
t60214.6%
 
e60214.6%
 
a60214.6%
 
d60214.6%
 
y60214.6%
 
U4380.3%
 
p4380.3%
 
D2470.2%
 
w2470.2%
 
n2470.2%
 

glyburide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
47401 
Steady
5044 
Up
 
463
Down
 
308
ValueCountFrequency (%) 
No4740189.1%
 
Steady50449.5%
 
Up4630.9%
 
Down3080.6%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.390709561
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o4770937.5%
 
N4740137.3%
 
S50444.0%
 
t50444.0%
 
e50444.0%
 
a50444.0%
 
d50444.0%
 
y50444.0%
 
U4630.4%
 
p4630.4%
 
D3080.2%
 
w3080.2%
 
n3080.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter7400858.2%
 
Uppercase Letter5321641.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N4740189.1%
 
S50449.5%
 
U4630.9%
 
D3080.6%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o4770964.5%
 
t50446.8%
 
e50446.8%
 
a50446.8%
 
d50446.8%
 
y50446.8%
 
p4630.6%
 
w3080.4%
 
n3080.4%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin127224100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o4770937.5%
 
N4740137.3%
 
S50444.0%
 
t50444.0%
 
e50444.0%
 
a50444.0%
 
d50444.0%
 
y50444.0%
 
U4630.4%
 
p4630.4%
 
D3080.2%
 
w3080.2%
 
n3080.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII127224100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o4770937.5%
 
N4740137.3%
 
S50444.0%
 
t50444.0%
 
e50444.0%
 
a50444.0%
 
d50444.0%
 
y50444.0%
 
U4630.4%
 
p4630.4%
 
D3080.2%
 
w3080.2%
 
n3080.2%
 

tolbutamide
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53201 
Steady
 
15
ValueCountFrequency (%) 
No53201> 99.9%
 
Steady15< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.00112748
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5320150.0%
 
o5320150.0%
 
S15< 0.1%
 
t15< 0.1%
 
e15< 0.1%
 
a15< 0.1%
 
d15< 0.1%
 
y15< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5327650.0%
 
Uppercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53201> 99.9%
 
S15< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5320199.9%
 
t15< 0.1%
 
e15< 0.1%
 
a15< 0.1%
 
d15< 0.1%
 
y15< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106492100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5320150.0%
 
o5320150.0%
 
S15< 0.1%
 
t15< 0.1%
 
e15< 0.1%
 
a15< 0.1%
 
d15< 0.1%
 
y15< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106492100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5320150.0%
 
o5320150.0%
 
S15< 0.1%
 
t15< 0.1%
 
e15< 0.1%
 
a15< 0.1%
 
d15< 0.1%
 
y15< 0.1%
 

pioglitazone
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
49215 
Steady
 
3805
Up
 
135
Down
 
61
ValueCountFrequency (%) 
No4921592.5%
 
Steady38057.2%
 
Up1350.3%
 
Down610.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.288296753
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o4927640.5%
 
N4921540.4%
 
S38053.1%
 
t38053.1%
 
e38053.1%
 
a38053.1%
 
d38053.1%
 
y38053.1%
 
U1350.1%
 
p1350.1%
 
D610.1%
 
w610.1%
 
n610.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter6855856.3%
 
Uppercase Letter5321643.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N4921592.5%
 
S38057.2%
 
U1350.3%
 
D610.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o4927671.9%
 
t38055.6%
 
e38055.6%
 
a38055.6%
 
d38055.6%
 
y38055.6%
 
p1350.2%
 
w610.1%
 
n610.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin121774100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o4927640.5%
 
N4921540.4%
 
S38053.1%
 
t38053.1%
 
e38053.1%
 
a38053.1%
 
d38053.1%
 
y38053.1%
 
U1350.1%
 
p1350.1%
 
D610.1%
 
w610.1%
 
n610.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII121774100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o4927640.5%
 
N4921540.4%
 
S38053.1%
 
t38053.1%
 
e38053.1%
 
a38053.1%
 
d38053.1%
 
y38053.1%
 
U1350.1%
 
p1350.1%
 
D610.1%
 
w610.1%
 
n610.1%
 

rosiglitazone
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
49765 
Steady
 
3291
Up
 
100
Down
 
60
ValueCountFrequency (%) 
No4976593.5%
 
Steady32916.2%
 
Up1000.2%
 
Down600.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.249624173
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o4982541.6%
 
N4976541.6%
 
S32912.7%
 
t32912.7%
 
e32912.7%
 
a32912.7%
 
d32912.7%
 
y32912.7%
 
U1000.1%
 
p1000.1%
 
D600.1%
 
w600.1%
 
n600.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter6650055.5%
 
Uppercase Letter5321644.5%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N4976593.5%
 
S32916.2%
 
U1000.2%
 
D600.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o4982574.9%
 
t32914.9%
 
e32914.9%
 
a32914.9%
 
d32914.9%
 
y32914.9%
 
p1000.2%
 
w600.1%
 
n600.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin119716100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o4982541.6%
 
N4976541.6%
 
S32912.7%
 
t32912.7%
 
e32912.7%
 
a32912.7%
 
d32912.7%
 
y32912.7%
 
U1000.1%
 
p1000.1%
 
D600.1%
 
w600.1%
 
n600.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII119716100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o4982541.6%
 
N4976541.6%
 
S32912.7%
 
t32912.7%
 
e32912.7%
 
a32912.7%
 
d32912.7%
 
y32912.7%
 
U1000.1%
 
p1000.1%
 
D600.1%
 
w600.1%
 
n600.1%
 

acarbose
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53077 
Steady
 
132
Up
 
7
ValueCountFrequency (%) 
No5307799.7%
 
Steady1320.2%
 
Up7< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.009921828
Min length2

Overview of Unicode Properties

Unique unicode characters10
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5307749.6%
 
o5307749.6%
 
S1320.1%
 
t1320.1%
 
e1320.1%
 
a1320.1%
 
d1320.1%
 
y1320.1%
 
U7< 0.1%
 
p7< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5374450.2%
 
Uppercase Letter5321649.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N5307799.7%
 
S1320.2%
 
U7< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5307798.8%
 
t1320.2%
 
e1320.2%
 
a1320.2%
 
d1320.2%
 
y1320.2%
 
p7< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106960100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5307749.6%
 
o5307749.6%
 
S1320.1%
 
t1320.1%
 
e1320.1%
 
a1320.1%
 
d1320.1%
 
y1320.1%
 
U7< 0.1%
 
p7< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106960100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5307749.6%
 
o5307749.6%
 
S1320.1%
 
t1320.1%
 
e1320.1%
 
a1320.1%
 
d1320.1%
 
y1320.1%
 
U7< 0.1%
 
p7< 0.1%
 

miglitol
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53206 
Steady
 
10
ValueCountFrequency (%) 
No53206> 99.9%
 
Steady10< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000751654
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5320650.0%
 
o5320650.0%
 
S10< 0.1%
 
t10< 0.1%
 
e10< 0.1%
 
a10< 0.1%
 
d10< 0.1%
 
y10< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5325650.0%
 
Uppercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53206> 99.9%
 
S10< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5320699.9%
 
t10< 0.1%
 
e10< 0.1%
 
a10< 0.1%
 
d10< 0.1%
 
y10< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106472100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5320650.0%
 
o5320650.0%
 
S10< 0.1%
 
t10< 0.1%
 
e10< 0.1%
 
a10< 0.1%
 
d10< 0.1%
 
y10< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106472100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5320650.0%
 
o5320650.0%
 
S10< 0.1%
 
t10< 0.1%
 
e10< 0.1%
 
a10< 0.1%
 
d10< 0.1%
 
y10< 0.1%
 

troglitazone
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53214 
Steady
 
2
ValueCountFrequency (%) 
No53214> 99.9%
 
Steady2< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000150331
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5321450.0%
 
o5321450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5322450.0%
 
Uppercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53214> 99.9%
 
S2< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o53214> 99.9%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106440100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5321450.0%
 
o5321450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106440100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5321450.0%
 
o5321450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

tolazamide
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53192 
Steady
 
24
ValueCountFrequency (%) 
No53192> 99.9%
 
Steady24< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.001803969
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5319249.9%
 
o5319249.9%
 
S24< 0.1%
 
t24< 0.1%
 
e24< 0.1%
 
a24< 0.1%
 
d24< 0.1%
 
y24< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5331250.0%
 
Uppercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53192> 99.9%
 
S24< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5319299.8%
 
t24< 0.1%
 
e24< 0.1%
 
a24< 0.1%
 
d24< 0.1%
 
y24< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106528100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5319249.9%
 
o5319249.9%
 
S24< 0.1%
 
t24< 0.1%
 
e24< 0.1%
 
a24< 0.1%
 
d24< 0.1%
 
y24< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106528100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5319249.9%
 
o5319249.9%
 
S24< 0.1%
 
t24< 0.1%
 
e24< 0.1%
 
a24< 0.1%
 
d24< 0.1%
 
y24< 0.1%
 

insulin
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
26502 
Steady
16690 
Down
5207 
Up
4817 
ValueCountFrequency (%) 
No2650249.8%
 
Steady1669031.4%
 
Down52079.8%
 
Up48179.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length3.450202946
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o3170917.3%
 
N2650214.4%
 
S166909.1%
 
t166909.1%
 
e166909.1%
 
a166909.1%
 
d166909.1%
 
y166909.1%
 
D52072.8%
 
w52072.8%
 
n52072.8%
 
U48172.6%
 
p48172.6%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter13039071.0%
 
Uppercase Letter5321629.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N2650249.8%
 
S1669031.4%
 
D52079.8%
 
U48179.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o3170924.3%
 
t1669012.8%
 
e1669012.8%
 
a1669012.8%
 
d1669012.8%
 
y1669012.8%
 
w52074.0%
 
n52074.0%
 
p48173.7%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin183606100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o3170917.3%
 
N2650214.4%
 
S166909.1%
 
t166909.1%
 
e166909.1%
 
a166909.1%
 
d166909.1%
 
y166909.1%
 
D52072.8%
 
w52072.8%
 
n52072.8%
 
U48172.6%
 
p48172.6%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII183606100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o3170917.3%
 
N2650214.4%
 
S166909.1%
 
t166909.1%
 
e166909.1%
 
a166909.1%
 
d166909.1%
 
y166909.1%
 
D52072.8%
 
w52072.8%
 
n52072.8%
 
U48172.6%
 
p48172.6%
 
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
52855 
Steady
 
351
Up
 
7
Down
 
3
ValueCountFrequency (%) 
No5285599.3%
 
Steady3510.7%
 
Up7< 0.1%
 
Down3< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.026495791
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o5285849.0%
 
N5285549.0%
 
S3510.3%
 
t3510.3%
 
e3510.3%
 
a3510.3%
 
d3510.3%
 
y3510.3%
 
U7< 0.1%
 
p7< 0.1%
 
D3< 0.1%
 
w3< 0.1%
 
n3< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5462650.7%
 
Uppercase Letter5321649.3%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N5285599.3%
 
S3510.7%
 
U7< 0.1%
 
D3< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5285896.8%
 
t3510.6%
 
e3510.6%
 
a3510.6%
 
d3510.6%
 
y3510.6%
 
p7< 0.1%
 
w3< 0.1%
 
n3< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin107842100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o5285849.0%
 
N5285549.0%
 
S3510.3%
 
t3510.3%
 
e3510.3%
 
a3510.3%
 
d3510.3%
 
y3510.3%
 
U7< 0.1%
 
p7< 0.1%
 
D3< 0.1%
 
w3< 0.1%
 
n3< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII107842100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o5285849.0%
 
N5285549.0%
 
S3510.3%
 
t3510.3%
 
e3510.3%
 
a3510.3%
 
d3510.3%
 
y3510.3%
 
U7< 0.1%
 
p7< 0.1%
 
D3< 0.1%
 
w3< 0.1%
 
n3< 0.1%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53213 
Steady
 
3
ValueCountFrequency (%) 
No53213> 99.9%
 
Steady3< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000225496
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5321350.0%
 
o5321350.0%
 
S3< 0.1%
 
t3< 0.1%
 
e3< 0.1%
 
a3< 0.1%
 
d3< 0.1%
 
y3< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5322850.0%
 
Uppercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53213> 99.9%
 
S3< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o53213> 99.9%
 
t3< 0.1%
 
e3< 0.1%
 
a3< 0.1%
 
d3< 0.1%
 
y3< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106444100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5321350.0%
 
o5321350.0%
 
S3< 0.1%
 
t3< 0.1%
 
e3< 0.1%
 
a3< 0.1%
 
d3< 0.1%
 
y3< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106444100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5321350.0%
 
o5321350.0%
 
S3< 0.1%
 
t3< 0.1%
 
e3< 0.1%
 
a3< 0.1%
 
d3< 0.1%
 
y3< 0.1%
 

glimepiride-pioglitazone
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53216 
ValueCountFrequency (%) 
No53216100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

Overview of Unicode Properties

Unique unicode characters2
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5321650.0%
 
o5321650.0%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter5321650.0%
 
Lowercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53216100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o53216100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106432100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5321650.0%
 
o5321650.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106432100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5321650.0%
 
o5321650.0%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53214 
Steady
 
2
ValueCountFrequency (%) 
No53214> 99.9%
 
Steady2< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000150331
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5321450.0%
 
o5321450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5322450.0%
 
Uppercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53214> 99.9%
 
S2< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o53214> 99.9%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106440100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5321450.0%
 
o5321450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106440100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5321450.0%
 
o5321450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
53215 
Steady
 
1
ValueCountFrequency (%) 
No53215> 99.9%
 
Steady1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000075165
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5321550.0%
 
o5321550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5322050.0%
 
Uppercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N53215> 99.9%
 
S1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o53215> 99.9%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106436100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5321550.0%
 
o5321550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106436100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5321550.0%
 
o5321550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

change
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
No
29789 
Ch
23427 
ValueCountFrequency (%) 
No2978956.0%
 
Ch2342744.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N2978928.0%
 
o2978928.0%
 
C2342722.0%
 
h2342722.0%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter5321650.0%
 
Lowercase Letter5321650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N2978956.0%
 
C2342744.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o2978956.0%
 
h2342744.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin106432100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N2978928.0%
 
o2978928.0%
 
C2342722.0%
 
h2342722.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII106432100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N2978928.0%
 
o2978928.0%
 
C2342722.0%
 
h2342722.0%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
Yes
39997 
No
13219 
ValueCountFrequency (%) 
Yes3999775.2%
 
No1321924.8%
 

readmitted
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size415.9 KiB
NO
41195 
>30
9868 
<30
 
2153
ValueCountFrequency (%) 
NO4119577.4%
 
>30986818.5%
 
<3021534.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length3
Median length2
Mean length2.22589071
Min length2

Overview of Unicode Properties

Unique unicode characters6
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N4119534.8%
 
O4119534.8%
 
31202110.1%
 
01202110.1%
 
>98688.3%
 
<21531.8%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter8239069.6%
 
Decimal Number2404220.3%
 
Math Symbol1202110.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N4119550.0%
 
O4119550.0%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
>986882.1%
 
<215317.9%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
31202150.0%
 
01202150.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin8239069.6%
 
Common3606330.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N4119550.0%
 
O4119550.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
31202133.3%
 
01202133.3%
 
>986827.4%
 
<21536.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII118453100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N4119534.8%
 
O4119534.8%
 
31202110.1%
 
01202110.1%
 
>98688.3%
 
<21531.8%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

racegenderageadmission_type_iddischarge_disposition_idadmission_source_idtime_in_hospitalmedical_specialtynum_lab_proceduresnum_proceduresnum_medicationsnumber_outpatientnumber_emergencynumber_inpatientdiag_1diag_2diag_3number_diagnosesmax_glu_serumA1Cresultmetforminrepaglinidenateglinidechlorpropamideglimepirideacetohexamideglipizideglyburidetolbutamidepioglitazonerosiglitazoneacarbosemiglitoltroglitazonetolazamideinsulinglyburide-metforminglipizide-metforminglimepiride-pioglitazonemetformin-rosiglitazonemetformin-pioglitazonechangediabetesMedreadmitted
0CaucasianFemale[0-10)62511Pediatrics-Endocrinology4101000250.83250.83250.831NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNO
1CaucasianFemale[10-20)1173Unknow59018000276250.012559NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYes>30
2AfricanAmericanFemale[20-30)1172Unknow11513201648250V276NoneNoneNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYesNO
3CaucasianMale[30-40)1172Unknow441160008250.434037NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
4CaucasianMale[40-50)1171Unknow51080001971572505NoneNoneNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
5CaucasianMale[50-60)2123Unknow316160004144112509NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYes>30
6CaucasianMale[60-70)3124Unknow70121000414411V457NoneNoneSteadyNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
7CaucasianMale[70-80)1175Unknow730120004284922508NoneNoneNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoYes>30
8CaucasianFemale[80-90)21413Unknow68228000398427388NoneNoneNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
9CaucasianFemale[90-100)33412InternalMedicine333180004341984868NoneNoneNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoSteadyNoNoNoNoNoChYesNO

Last rows

racegenderageadmission_type_iddischarge_disposition_idadmission_source_idtime_in_hospitalmedical_specialtynum_lab_proceduresnum_proceduresnum_medicationsnumber_outpatientnumber_emergencynumber_inpatientdiag_1diag_2diag_3number_diagnosesmax_glu_serumA1Cresultmetforminrepaglinidenateglinidechlorpropamideglimepirideacetohexamideglipizideglyburidetolbutamidepioglitazonerosiglitazoneacarbosemiglitoltroglitazonetolazamideinsulinglyburide-metforminglipizide-metforminglimepiride-pioglitazonemetformin-rosiglitazonemetformin-pioglitazonechangediabetesMedreadmitted
53206CaucasianFemale[40-50)14714Unknow690160002953052505None>7UpNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoDownNoNoNoNoNoChYes>30
53207CaucasianFemale[70-80)3613Orthopedics271290107154012509NoneNormSteadyNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
53208CaucasianMale[70-80)36113Unknow7766500042442948616NoneNormNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
53209OtherFemale[40-50)3113Unknow13150003487847828NoneNoneSteadyNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
53210OtherMale[40-50)11713Unknow51213000250.87307319NoneNoneSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoDownNoNoNoNoNoChYesNO
53211CaucasianFemale[70-80)1179Unknow50233000574574250.029None>7NoNoNoNoNoNoNoUpNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYes>30
53212OtherFemale[40-50)11714Unknow736260105925995189None>8NoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYes>30
53213OtherFemale[60-70)1172Unknow466171119965854039NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYes>30
53214CaucasianFemale[80-90)1175Unknow7612201029283049NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
53215CaucasianMale[70-80)1176Unknow13330005305307879NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNO